Number of Common Sites Visited by tV Random Walkers 
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We compute analytically the mean number of common sites, W^it), visited by N independent random 
walkers each of length t and all starting at the origin at t — in d dimensions. We show that in the (TV — d) 
plane, there are three distinct regimes for the asymptotic large t growth of Wtf(t). These three regimes are 
separated by two critical lines d — 2 and d = d c (N) = 2N/(N — 1) in the (TV — d) plane. For d < 2, 
W N (t) ~ t d/2 for large t (the N dependence is only in the prefactor). For 2 < d < d c (N), W N {t) ~ t" 
where the exponent v = TV — d(N — 1) /2 varies with TV and d. For d > d c (N), Wn(1>) — ► const, as t — > oo. 
Exactly at the critical dimensions there are logaritmic corrections: for d = 2, we get Wn(£) ~ t/\hnt] N , while 
for d = d c (N), WN(t) ~ lnt for large t. Our analytical predictions are verified in numerical simulations. 

PACS numbers: 05.40.Fb, 05.40.Jc, 02.50.Cw, 24.60.-k 



Computing the average number of distinct sites visited by 
a single i-step random walker on a rf-dimensional lattice, de- 
noted by Si(t), is by now a classic problem with a variety of 
applications ranging from the annealing of defects in crystals 
to the size of the territory covered by a diffusing animal dur- 
ing the foraging period. First posed and studied by Dvoretzky 
and Erdos in 1951 01 > this problem has been solved exactly 
in a number of papers in the 1960's fy. It is well estab- 
lished (see [4] for a review) that asymptotically for large t, 
Si(t) - t d ' 2 for d < 2, ~ t/\n{t) for d = 2 and - t for 
d > 2. These results have been widely used in a number of 
applications in physic s ||5j -|7|], chemistry [8], metallurgy 

a- 

111!! , and ecology 1121 1 1 311 . In 1992, Larralde and coworkers 
generalized this problem to the case of TV independent ran- 
dom walkers (each of t steps) all starting at the origin of a d- 
dimensional lattice lfl4ll . They computed analytically 5jy(t), 
the mean number of sites visited by at least one of the TV walk- 
ers in d dimensions and found two interesting time scales as- 
sociated with the growth of SN(t). In the ecological context, 
Sn (t) represents the mean size of the territory covered by an 
animal population of size TV. The original results of Larralde 
et. al. have subsequently been corrected ||15|1 . used and gener- 
alised in a number of other applications 




FIG. 1. (color online) A realization of 3 random walks, each of 
6 steps (denoted respectively by solid (blue), dashed (purple), and 
dashed-dotted (green) lines) on a square lattice, all starting at the ori- 
gin O. There are two sites (each marked by a filled (red) circle) that 
are visited by all 3 walkers. 
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In this Letter, we study a complementary question: what 
is the average number of common sites, Wxit), visited by TV 
independent walkers, each of them consisting of t steps and 
starting at the origin of a rf-dimensional lattice? A typical 
realization in d = 2 for TV = 3 walkers is shown in Fig. (HJ. 
Our exact results demonstrate that Wij{t) exhibits a rather 
rich asymptotic behavior for large t. In the (TV — d) plane 
(TV being the number of walkers, or the population size in 
ecological context, and d - the space dimension) we find an 
interesting phase diagram where two critical lines d = 2 and 
d c (N) — 2TV/(TV — 1) separate three phases with different 
asymptotic growth of Wjv(t) (see Fig. (01). For large t, we 



W N (t)~t d/2 for d < 2 

~ t v for 2 < d < d c (N) 
~ const, for d > d c (N) 
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where the exponent v = TV — rf(TV — l)/2 varies with TV and 
d. Exactly at the two critical dimension, there are logarithmic 
corrections. In particular, for large t, W^{t) ~ t/\hxt] N in 
d = 2, and W N (t) - latin d = d c (N) (with TV > 1). 
The existence of the intermediate phase 2 < d < d c (TV) — 
2N I (TV — 1), with a growth exponent v varying with TV and 
d, is perhaps the most striking of our results. For instance, for 
TV = 2 we have d c (2) = 4 and so in 2 < d = 3 < 4 our 
result predicts v = 1/2, i.e., W%(Jb) ~ i 1 / 2 , a prediction that 
is verified in our numerical simulations. 

The statistics of the number of most popular sites, i.e., the 
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sites visited by all the walkers arises quite naturally in a num- 
ber of contexts such as sociology, ecology, artificial networks 
(e.g., internet, transport and engineering networks) and poly- 
mer networks just to name a few. For example in a multi- 
ple user network such as the internet, the most popular 'hub' 
sites visited by all the users are known to play very important 
role in the dissemination of information [27]. The knowledge 
of how many of them are there is fundamental for many ap- 
plications. In tourism industry, it is important to know the 
number of most popular sites in a given area or city that are 
visited by all the tourists. Motivated by this general question, 
in this Letter we study the statistics of the number of most 
popular sites in perhaps the simplest model, namely for N 
independent random walkers in the e?-dimensional space and 
show that even in this simple model, the asymptotic temporal 
growth of the mean number of common sites frequented by 
all N walkers exhibits surprisingly rich behavior. We show 
that our results also have close connections to the probability 
of non-intersection of random walks studied in the mathemat- 
ics literature 1^8, 29]. Given the abundance of random walks 
used as a fundamental model to study numerous natural and 
artificial systems, and the richness of our exact results, we be- 
lieve that they will be useful in more specific applications in 
the future. 

We consider N independent i-step walkers on a d- 
dimensional lattice, each starting at the origin. To compute 
the number of common sites visited by all the N walkers, it is 
first useful to introduce a binary random variable ak,N(x,t) 
associated with each site x such that <Jk,N{x, t) = 1 if the site 
x is visited by exactly k of the TV walkers and ak,N{x, t) = 
otherwise. Then the sum Vk,N(t) — J2s a k.N(x,t) repre- 
sents the number of sites visited by exactly k of the N walkers 
each of t steps in a particular realization of the walks. Clearly, 
Vfe,iv(£) is a random variable that fluctuates from one sample 
to another. Taking average gives the mean number of sites vis- 
ited by exactly k walkers, (Vk,N{t)) = Yls Pk,N{x, t) where 
Pk.N(x, t) = ((Tfejv(a;, t)) is the probability that the site x is 
visited by exactly k of the N walkers. Since the walkers are 
independent, one can write 



Pk,N{x, t) 



[p(x,t)f [l-p(x,t)} 



N-k 



(2) 



where p(x, t) is the probability that the site x is visited by a 
single i-step walker starting at the origin. Thus 



MA*)) = ( ) E w,tf [i~p{x,t)f- k . o) 



Finally, the mean number of common sites visited by all the 
N walkers is simply 



w N (t) = (v N , N (t)) =X>(*>*)] 



N 



(4) 



Hence, once the basic quantity p(x, t) for a single walker is 
known, we can determine (Vk t isr(t)) and in particular Ww(t) 



just by summing over all sites as in Eq. (0). Note that, by 
definition, p(0, t) — 1 for all t > since the walker starts at 
the origin. 

The probability p(x, t) can be fully determined for a lattice 
walker with discrete time steps using the standard generating 
function technique (3J. However, since we are interested here 
mainly in the asymptotic large t regime, it is much easier to 
work directly in the continuum limit where we treat both space 
x and time t as continuous variables. Consider then a single 
Brownian motion of length t and diffusion constant D in d- 
dimensions starting at the origin. We are interested in p(x, t), 
the probability that the site x is visited (at least once) by the 
walker up to time t. Let r denote the last time before t that 
the site x was visited by the walker. Then, clearly 



p(x,t) 



G{x, t) q{t 



i dr 



(5) 



where G(x,t) = e'* 2 ' ADt / {An D t) d / 2 (where x = \x\) is 
the standard Green's function denoting the probability that the 
particle is at x at time r and q(r) denotes the persistence, i.e., 
the probability that starting at x, the walker does not return to 
its starting point up to time t. Note that q(r) does not depend 
on the starting point x and is the same as the probability of no 
return to the origin up to time r. Indeed, q{r) = /(t')cJt' 
where /(r) = —dq/dr is the standard first-passage probabil- 
ity to the origin [30]. 

The no-return probability q(r) for a Brownian walker has 
been studied extensively and it is well known that for large r, 
q(r) - t' 1 / 2 - 1 for d < 2, q(r) ~ 1/lnr for d = 2, while it 
approaches a constant for d > 2 since the walker can escape to 
infinity with a finite probability for d > 2 13011 . One can show 
that to analyze the large t behavior of p(x, t) in Eq. <(5j in 
the scaling regime where x — > oo, t —} oo but keeping xj '\ft 
fixed, it suffices to substitute only the asymptotic behavior of 
q(r) in Eq. (0. This gives, for large t 



p(2,*)~ / G(2,T)(t-T) d/a - 1 dTford<2 (6) 
Jo 

p(x, t) ~ / G(x, r)dr for d > 2 (7) 

where we have dropped unimportant constants for conve- 
nience. For d — 2, p{x,t) ~ f Q G(x, r)dr/ln(t — r). 
Substituting the exact Green's function G(x, r) = 
e ~x 2 /4Dr^ A7rDT y/2 one finds that p (£ )f ) has the 

following asymptotic scaling behavior 



p(x, t) a t 1 ^ 2 /> (-/=) for d > 2 (9) 
where the scaling functions for d < 2 and d > 2 can be ex- 
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W N (t) ~ const 



d c (N) = 2N/(N-l) 



N 
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FIG. 2. (color online) In the (N — d) plane, there are two critical 
lines d = 2 (lower horizontal line) and d c (N) = 2N/(N - 1) (up- 
per dashed (red) curve). The mean number of common sites Wjv(t) 
visited by N walkers, each of t steps and all starting at the origin at 
t = 0, have different asymptotic behaviors for large t in the three 
regimes d < 2, 2 < d < d c (N) and d > d c (N). 



pressed explicitly as 

/< (z) = f e~ z2 ' u u- d ' 2 (1 - u) d ' 2 - 1 du (10) 
Jo 

/>(*)= / e-* 2 ! u u- d ' 2 du. (11) 
Jo 

Exactly at d = 2, one gets p(x,t) s» (l/lni)/ 2 

where /2(z) = J^due~ z / u /u. 

It is easy to derive the asymptotic tails of the scaling func 
tions. One finds 



/< (z) w const. asz->0 



OO 



and 



/>0) z" (d - 2) asz^O 



as z — > oo 



(12) 



(13) 



At d = 2, one finds f2(z) ~ — 21n(z) as z — >• and /2(z) ~ 
e _z /z 2 as z — ► ex). Note that the scaling forms postulated 
in Eqs. (0 and © do not, in general, hold for very small x. 
For d < 2, the scaling regime can actually be extended all 
the way to x — > and indeed, the exact relation p(0, t) = 1 
is actually part of the scaling regime. This is seen by taking 
x — > limit in Eq. (© and using the asymptotic small z 
behavior of /< (z) in Eq. (fT2l . In contrast, for d > 2, one can 
not recover p(0, t) = 1 by taking x — > limit in Eq. (|9). This 
is a manifestation of the fact that for d > 2 one always needs a 
finite lattice cut-off a > (see, e.g. 04[]). Thus for d > 2, the 
continuum scaling result in Eq. (0 does not hold for x < a. 

We next substitute Eqs. (0 and (O in Eq. and replace 
the sum by an integral over space. Note that even though we 



started out with d and N being integers, the general formula 
dU can be analytically continued to real d > and real N > 0. 
So, from now on we will consider d and N to be continuous 
real positive numbers as, e.g., represented in the phase dia- 
gram in Fig. (O. Indeed, non-integer values of d can be in- 
terpreted in terms of random walks on fractal manifolds with 
non-integer dimensions. Consider first the case d < 2 where 
we get, dropping unimportant prefactors, for large t 



W N (t)~t d / 2 



[f<(z)f 



dz . 



(14) 



From the tails of the scaling function /< (z) in Eq. (fT2l . it is 
evident that the integral in Eq. (fl4l is convergent and is just 
a constant and hence for d < 2, Wjsi{t) ~ &jv t d ^ 2 for large t 
with only the prefactor fc/v, but not the exponent, depending on 

N. Exactly at d = 2, using p(x,t) - [1/lnt] / 2 (fx/\/4Di 
and following a similar analysis we get for large t 



[/ 2 (z)r«<k 



(15) 



Using the exact form of the scaling function /2(z) described 
before, one can check that the integral above is convergent and 
hence, for d = 2, W N {t) ~ t/[hit} N for large t. 

For d > 2, a similar manipulation is a bit more delicate. 
We recall that the scaling result for p(x, t) in Eq. (O holds 
only for x > a where a is a lattice cut-off, while p(0, t) = 1 
identically. Thus, in the sum in Eq. we separate x = 
term and replace the rest of the sum by an integral over the 
scaling form 



1 A ? 



.d-l 



dx 



W N {t)^l + A d t N ^- d '^ 

./,, \ \ -i-ui / 

(16) 

where is a volume dependent constant and a is the lattice 
cut-off. This gives, after rescaling z = xj V 'ADt 



W N {t)^l + A d t N -^ N -^ d ' 2 



a/ViDt 



[/>(*)] 



TV 



Z^^Z. 



(17) 

We now have to check how the integral behaves as t — > oo, 
i.e., its lower limit approaches 0. This is controlled by the 
small z behavior of the integrand. From Eq. (QjJ, we get 



[/>(*)] 



N 



_ Ar ( d ^) as z ^ 0. Hence the integrand be- 
haves as z d -(d—2)N-i as z _^ q Thus two situations arise. If 
d-(d-2)N > 0,U.,d< d c (N) = 2N/{N- 1) (recall that 
d > 2 already), the integral is convergent at the lower limit 
and one can safely take the t — > oo limit and then Eq. ( TP71 ) 
predicts that for large t and 2 < d < d c (N) = 2N/(N - 1) 



W N {t)~f; v = N -d{N -l)/2. 



(18) 



In contrast, if d-(d-2)N < 0, i.e., d > d c (N) = 2N/(N- 
1), the lower limit of the integral behaves as ~ ^(N-i)d/2-N 
for large t which precisely cancels the power-law prefactor 
and 



W N {t) -> const.; d > d c {N) = 2N/(N - 1) 



(19) 
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FIG. 3. (color online) Numerical results for Wrf(t) vs. t for different values of TV and d: (a) d = 1, the black lines (from top to bottom) 
correspond to Wjv(i) vs. t for TV = 1,2, ... ,7 averaged over 100000 realisations, the dashed (red) lines have slope 1/2; (b) In d — 2, we 
plot ajv Wjv(t) put] vs. t for iV = 1, 2, 3, 4 (from bottom to top) averaged over 100000 realisations. The prefactor ajv is chosen so that 
the curves start at the same point to make the visualisation better, (c) d = 3, the black lines correspond to N = 1, 2, 3, 4 (from top to bottom) 
averaged over 7000 realisations. Analytical results predict W N (t) ~ t {3 ~ N)/2 for TV < 3, Wnr(t) ~ lnt for TV = 3 and Wjv(t) ~ const, 
for TV* = 4. The dashed (red) lines have slopes 1 (TV = 1), 1/2 (TV = 2), and (TV = 4), the dotted (blue) line is proportional to In t (TV = 3). 



where the constant evidently depends on the cut-off, i.e., on 
the details of the lattice and is thus nonuniversal. Physically 
this means that for d > d c (N), the common sites visited by 
all the walkers are typically close to the origin and are visited 
at relatively early times. At late times, the walkers hardly 
overlap and hence Wjv(t) does not grow with time. Finally, 
exactly at d = d c (N), a similar analysis shows that W^{t) ~ 
ln(t) for large t. The upper phase boundary in Fig. (|2) depicts 
the critical line d c (N) = 2N/(N - 1) as a function of TV. 
Alternatively, for fixed 2 < d < d c (N), this critical line can 
also be described as TV c (d) = d/(d-2). For 1 < TV < N c (d), 
we have W N (t) ~ t v with v = TV - d(N -1)1% 

To check our analytical predictions, we have computed 
Wjv(t) numerically for d = 1, 2, 3 and for several values 
of TV. In d = 1, our result predicts that W N {t) — b N t 1/2 
for large t where the exponent 1/2 is independent of TV and 
only the prefactor 6at depends on TV. The results in Fig. (3a) 
are consistent with this prediction. In d = 2, our results pre- 
dict that Wjy(t) ~ t/[lnt] which is verified numerically in 
Fig. (3b). For d = 3, our result predicts that there is a crit- 
ical value TV C = 3 such that W N {t) ~ t^- N ~>/ 2 for TV < 3, 
W N {t) ~ ln(t) for TV = 3 and W N (t) ~ const, for TV > 3. 
The simulation results for d = 3 in Fig. (3c) are consistent 
with these predictions. 

Interestingly, the critical dimension d c (N) = 2TV/ (TV — 1) 
has also appeared in the probability literature [29] in the con- 
text of the probability of no intersection of TV random walkers 
up to t steps all starting at the origin Il28ll . To make a pre- 



cise connection with our work presented here, consider the 
random variable Vjv.jvW tnat denotes the number of com- 
mon sites visited by all the TV walkers up to t steps. Since 
all the walkers start at the origin, clearly the number of com- 
mon sites visited must be at least 1 implying Vjv,jv(t) > !• 
When V/v Ar(t) = 1, it corresponds to the event that the 
walkers do not intersect further up to step t and the origin 
at t = remains the only site visited by all of them up to 
step t. Thus, the probability of no further intersection up to 
step t is Fjv(t) = Prob.[Vjv,jv(t) = !]• Lawler studied the 
decay of F/v (t) for large t rigorously in special cases 112 811 and 
Duplantier showed 112911 that Fjv(t) approaches a constant as 
t -> oo for d > d c {N) = 2TV/(TV - 1). For d < d c {N), 
•Fjv(t) ~ and the exponent £ was computed using an e 
expansion around the critical dimension f|29ll. In contrast, in 
this Letter we have computed the mean of the random variable 
V N<N (t), i.e., W N {t) = (V N . N (t)). Note that while F N (t) is 
not exactly computable in all d, Wn (t) is, as we have shown 
here. 

Another interesting related problem is to compute the mean 
number of TV-fold self intersections of a single ideal poly- 
mer chain of length t. In Ref. 13111 . it was stated that in 
d = 3 this grows as t( 3-Ar )/ 2 , which looks similar to our re- 
sult Wjv(t) ~ £( 3-Ar )/ 2 in the intermediate phase in d = 3 
and for 1 < TV < 3. However, the two problems are not 
exactly identical and even the single chain result in Ref. 113 111 
was qualitatively argued for, not rigorously proved, and the 
logarithmic correction for TV = 3 was not mentioned. 
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In summary, we have presented exact asymptotic results for 
the mean number of common sites Wn (t) visited by TV inde- 
pendent random walkers in d dimensions. We have shown that 



as a function of TV and d in the (TV — d) plane, there are three 
distinct regimes for the growth of Wj\r(£)> including in partic- 
ular, an anomalous intermediate regime 2 < d < d c (N) = 
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2NJ (N — 1). There are several directions in which our work 
can be generalized. For instance, it would be easy to compute 
the mean number of sites visited exactly by k walkers (out of 
N) up to time t using our result in Eq. (fJJ. Here we have 
restricted only to the k = N case for simplicity. It would be 
interesting to consider cases where the walkers have differ- 
ent step lengths or when they start at different positions 113211 . 
Also, computing the full distribution of the number of com- 
mon sites visited by all walkers remains a challenging open 
problem. 
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